Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

A Comparison of Some Morphological Filters for Improving OCR Performance

Identifieur interne : 000024 ( Main/Exploration ); précédent : 000023; suivant : 000025

A Comparison of Some Morphological Filters for Improving OCR Performance

Auteurs : Laurent Mennillo [France] ; Jean Cousty [France] ; Laurent Najman [France]

Source :

RBID : Hal:hal-01168641

English descriptors

Abstract

Studying discrete space representations has recently lead to the development of novel morphological operators. To date, there has been no study evaluating the performances of those novel operators with respect to a specific application. This article compares the capability of several morphological operators, both old and new, to improve OCR performance when used as preprocessing filters. We design an experiment using the Tesseract OCR engine on binary images degraded with a realistic document-dedicated noise model. We assess the performances of some morphological filters acting in complex, graph and vertex spaces, including the area filters. This experiment reveals the good overall performance of complex and graph filters. MSE measures have also been performed to evaluate the denoising capability of these filters, which again confirms the performances of both complex and graph filtering on this aspect.

Url:
DOI: 10.1007/978-3-319-18720-4_12


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">A Comparison of Some Morphological Filters for Improving OCR Performance</title>
<author>
<name sortKey="Mennillo, Laurent" sort="Mennillo, Laurent" uniqKey="Mennillo L" first="Laurent" last="Mennillo">Laurent Mennillo</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-186882" status="VALID">
<orgName>Institut Pascal [Aubiere]</orgName>
<desc>
<address>
<addrLine>24 avenue des Landais 63171 Aubiere Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.ip.univ-bpclermont.fr/index.php/fr/</ref>
</desc>
<listRelation>
<relation name="UMR6602" active="#struct-441569" type="direct"></relation>
<relation active="#struct-205618" type="direct"></relation>
<relation active="#struct-322672" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle name="UMR6602" active="#struct-441569" type="direct">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="IdRef">02636817X</idno>
<idno type="ISNI">0000000122597504</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-205618" type="direct">
<org type="institution" xml:id="struct-205618" status="VALID">
<orgName>Université Blaise Pascal - Clermont-Ferrand 2</orgName>
<orgName type="acronym">UBP</orgName>
<desc>
<address>
<addrLine>34, avenue Carnot - BP 185 - 63006 Clermont-Ferrand cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-bpclermont.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-322672" type="direct">
<org type="institution" xml:id="struct-322672" status="VALID">
<orgName>Sigma CLERMONT</orgName>
<orgName type="acronym">Sigma CLERMONT</orgName>
<date type="start">2016-01-01</date>
<desc>
<address>
<addrLine>Sigma CLERMONTCampus des Cézeaux CS 2026563178 Aubière Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.sigma-clermont.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
<author>
<name sortKey="Cousty, Jean" sort="Cousty, Jean" uniqKey="Cousty J" first="Jean" last="Cousty">Jean Cousty</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-3210" status="VALID">
<idno type="RNSR">200212717U</idno>
<orgName>Laboratoire d'Informatique Gaspard-Monge</orgName>
<orgName type="acronym">LIGM</orgName>
<desc>
<address>
<addrLine>Université de Paris-Est - Marne-la-Vallée, Cité Descartes, Bâtiment Copernic, 5 bd Descartes, 77454 Marne-la-Vallée Cedex 2, Inst Gaspard Monge</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://ligm.u-pem.fr</ref>
</desc>
<listRelation>
<relation active="#struct-301243" type="direct"></relation>
<relation active="#struct-301545" type="direct"></relation>
<relation active="#struct-302085" type="direct"></relation>
<relation active="#struct-304949" type="direct"></relation>
<relation name="UMR8049" active="#struct-441569" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-301243" type="direct">
<org type="institution" xml:id="struct-301243" status="VALID">
<orgName>Université Paris-Est Marne-la-Vallée</orgName>
<orgName type="acronym">UPEM</orgName>
<desc>
<address>
<addrLine>5 boulevard Descartes - Champs-sur-Marne - 77454 Marne-la-Vallée Cedex2 </addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.u-pem.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-301545" type="direct">
<org type="institution" xml:id="struct-301545" status="OLD">
<orgName>École des Ponts ParisTech (ENPC)</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-302085" type="direct">
<org type="institution" xml:id="struct-302085" status="VALID">
<orgName>Fédération de Recherche Bézout</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-304949" type="direct">
<org type="institution" xml:id="struct-304949" status="INCOMING">
<orgName>ESIEE</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="UMR8049" active="#struct-441569" type="direct">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="IdRef">02636817X</idno>
<idno type="ISNI">0000000122597504</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
<author>
<name sortKey="Najman, Laurent" sort="Najman, Laurent" uniqKey="Najman L" first="Laurent" last="Najman">Laurent Najman</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-3210" status="VALID">
<idno type="RNSR">200212717U</idno>
<orgName>Laboratoire d'Informatique Gaspard-Monge</orgName>
<orgName type="acronym">LIGM</orgName>
<desc>
<address>
<addrLine>Université de Paris-Est - Marne-la-Vallée, Cité Descartes, Bâtiment Copernic, 5 bd Descartes, 77454 Marne-la-Vallée Cedex 2, Inst Gaspard Monge</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://ligm.u-pem.fr</ref>
</desc>
<listRelation>
<relation active="#struct-301243" type="direct"></relation>
<relation active="#struct-301545" type="direct"></relation>
<relation active="#struct-302085" type="direct"></relation>
<relation active="#struct-304949" type="direct"></relation>
<relation name="UMR8049" active="#struct-441569" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-301243" type="direct">
<org type="institution" xml:id="struct-301243" status="VALID">
<orgName>Université Paris-Est Marne-la-Vallée</orgName>
<orgName type="acronym">UPEM</orgName>
<desc>
<address>
<addrLine>5 boulevard Descartes - Champs-sur-Marne - 77454 Marne-la-Vallée Cedex2 </addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.u-pem.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-301545" type="direct">
<org type="institution" xml:id="struct-301545" status="OLD">
<orgName>École des Ponts ParisTech (ENPC)</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-302085" type="direct">
<org type="institution" xml:id="struct-302085" status="VALID">
<orgName>Fédération de Recherche Bézout</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-304949" type="direct">
<org type="institution" xml:id="struct-304949" status="INCOMING">
<orgName>ESIEE</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="UMR8049" active="#struct-441569" type="direct">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="IdRef">02636817X</idno>
<idno type="ISNI">0000000122597504</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">HAL</idno>
<idno type="RBID">Hal:hal-01168641</idno>
<idno type="halId">hal-01168641</idno>
<idno type="halUri">https://hal.archives-ouvertes.fr/hal-01168641</idno>
<idno type="url">https://hal.archives-ouvertes.fr/hal-01168641</idno>
<idno type="doi">10.1007/978-3-319-18720-4_12</idno>
<date when="2015-05-27">2015-05-27</date>
<idno type="wicri:Area/Hal/Corpus">000003</idno>
<idno type="wicri:Area/Hal/Curation">000003</idno>
<idno type="wicri:Area/Hal/Checkpoint">000012</idno>
<idno type="wicri:Area/Main/Merge">000024</idno>
<idno type="wicri:Area/Main/Curation">000024</idno>
<idno type="wicri:Area/Main/Exploration">000024</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">A Comparison of Some Morphological Filters for Improving OCR Performance</title>
<author>
<name sortKey="Mennillo, Laurent" sort="Mennillo, Laurent" uniqKey="Mennillo L" first="Laurent" last="Mennillo">Laurent Mennillo</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-186882" status="VALID">
<orgName>Institut Pascal [Aubiere]</orgName>
<desc>
<address>
<addrLine>24 avenue des Landais 63171 Aubiere Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.ip.univ-bpclermont.fr/index.php/fr/</ref>
</desc>
<listRelation>
<relation name="UMR6602" active="#struct-441569" type="direct"></relation>
<relation active="#struct-205618" type="direct"></relation>
<relation active="#struct-322672" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle name="UMR6602" active="#struct-441569" type="direct">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="IdRef">02636817X</idno>
<idno type="ISNI">0000000122597504</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-205618" type="direct">
<org type="institution" xml:id="struct-205618" status="VALID">
<orgName>Université Blaise Pascal - Clermont-Ferrand 2</orgName>
<orgName type="acronym">UBP</orgName>
<desc>
<address>
<addrLine>34, avenue Carnot - BP 185 - 63006 Clermont-Ferrand cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-bpclermont.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-322672" type="direct">
<org type="institution" xml:id="struct-322672" status="VALID">
<orgName>Sigma CLERMONT</orgName>
<orgName type="acronym">Sigma CLERMONT</orgName>
<date type="start">2016-01-01</date>
<desc>
<address>
<addrLine>Sigma CLERMONTCampus des Cézeaux CS 2026563178 Aubière Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.sigma-clermont.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
<author>
<name sortKey="Cousty, Jean" sort="Cousty, Jean" uniqKey="Cousty J" first="Jean" last="Cousty">Jean Cousty</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-3210" status="VALID">
<idno type="RNSR">200212717U</idno>
<orgName>Laboratoire d'Informatique Gaspard-Monge</orgName>
<orgName type="acronym">LIGM</orgName>
<desc>
<address>
<addrLine>Université de Paris-Est - Marne-la-Vallée, Cité Descartes, Bâtiment Copernic, 5 bd Descartes, 77454 Marne-la-Vallée Cedex 2, Inst Gaspard Monge</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://ligm.u-pem.fr</ref>
</desc>
<listRelation>
<relation active="#struct-301243" type="direct"></relation>
<relation active="#struct-301545" type="direct"></relation>
<relation active="#struct-302085" type="direct"></relation>
<relation active="#struct-304949" type="direct"></relation>
<relation name="UMR8049" active="#struct-441569" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-301243" type="direct">
<org type="institution" xml:id="struct-301243" status="VALID">
<orgName>Université Paris-Est Marne-la-Vallée</orgName>
<orgName type="acronym">UPEM</orgName>
<desc>
<address>
<addrLine>5 boulevard Descartes - Champs-sur-Marne - 77454 Marne-la-Vallée Cedex2 </addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.u-pem.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-301545" type="direct">
<org type="institution" xml:id="struct-301545" status="OLD">
<orgName>École des Ponts ParisTech (ENPC)</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-302085" type="direct">
<org type="institution" xml:id="struct-302085" status="VALID">
<orgName>Fédération de Recherche Bézout</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-304949" type="direct">
<org type="institution" xml:id="struct-304949" status="INCOMING">
<orgName>ESIEE</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="UMR8049" active="#struct-441569" type="direct">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="IdRef">02636817X</idno>
<idno type="ISNI">0000000122597504</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
<author>
<name sortKey="Najman, Laurent" sort="Najman, Laurent" uniqKey="Najman L" first="Laurent" last="Najman">Laurent Najman</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-3210" status="VALID">
<idno type="RNSR">200212717U</idno>
<orgName>Laboratoire d'Informatique Gaspard-Monge</orgName>
<orgName type="acronym">LIGM</orgName>
<desc>
<address>
<addrLine>Université de Paris-Est - Marne-la-Vallée, Cité Descartes, Bâtiment Copernic, 5 bd Descartes, 77454 Marne-la-Vallée Cedex 2, Inst Gaspard Monge</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://ligm.u-pem.fr</ref>
</desc>
<listRelation>
<relation active="#struct-301243" type="direct"></relation>
<relation active="#struct-301545" type="direct"></relation>
<relation active="#struct-302085" type="direct"></relation>
<relation active="#struct-304949" type="direct"></relation>
<relation name="UMR8049" active="#struct-441569" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-301243" type="direct">
<org type="institution" xml:id="struct-301243" status="VALID">
<orgName>Université Paris-Est Marne-la-Vallée</orgName>
<orgName type="acronym">UPEM</orgName>
<desc>
<address>
<addrLine>5 boulevard Descartes - Champs-sur-Marne - 77454 Marne-la-Vallée Cedex2 </addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.u-pem.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-301545" type="direct">
<org type="institution" xml:id="struct-301545" status="OLD">
<orgName>École des Ponts ParisTech (ENPC)</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-302085" type="direct">
<org type="institution" xml:id="struct-302085" status="VALID">
<orgName>Fédération de Recherche Bézout</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-304949" type="direct">
<org type="institution" xml:id="struct-304949" status="INCOMING">
<orgName>ESIEE</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="UMR8049" active="#struct-441569" type="direct">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="IdRef">02636817X</idno>
<idno type="ISNI">0000000122597504</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
</analytic>
<idno type="DOI">10.1007/978-3-319-18720-4_12</idno>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="mix" xml:lang="en">
<term>Character recognition</term>
<term>Graphs</term>
<term>Morphological filtering</term>
<term>Sim-plicial complexes</term>
<term>Vertex</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Studying discrete space representations has recently lead to the development of novel morphological operators. To date, there has been no study evaluating the performances of those novel operators with respect to a specific application. This article compares the capability of several morphological operators, both old and new, to improve OCR performance when used as preprocessing filters. We design an experiment using the Tesseract OCR engine on binary images degraded with a realistic document-dedicated noise model. We assess the performances of some morphological filters acting in complex, graph and vertex spaces, including the area filters. This experiment reveals the good overall performance of complex and graph filters. MSE measures have also been performed to evaluate the denoising capability of these filters, which again confirms the performances of both complex and graph filtering on this aspect.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>France</li>
</country>
</list>
<tree>
<country name="France">
<noRegion>
<name sortKey="Mennillo, Laurent" sort="Mennillo, Laurent" uniqKey="Mennillo L" first="Laurent" last="Mennillo">Laurent Mennillo</name>
</noRegion>
<name sortKey="Cousty, Jean" sort="Cousty, Jean" uniqKey="Cousty J" first="Jean" last="Cousty">Jean Cousty</name>
<name sortKey="Najman, Laurent" sort="Najman, Laurent" uniqKey="Najman L" first="Laurent" last="Najman">Laurent Najman</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000024 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000024 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Hal:hal-01168641
   |texte=   A Comparison of Some Morphological Filters for Improving OCR Performance
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024